How the No-Cloning Theorem Got its Name 
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Abstract 

I was the referee who approved the publication of Nick Herbert's FLASH paper, knowing perfectly 
well that it was wrong. I explain why my decision was the correct one, and I briefly review the progress 
to which it led. 

> 

The no-cloning theorem CD, || is of fundamental importance in quantum theory. It asserts that no 
quantum amplifier can duplicate accurately two or more nonorthogonal quantum states. A simple proof 
requires only a few lines || . Why wasn't that theorem discovered fifty years earlier? What are the events 
that actually led to its discovery and publication? 

This is the story of my own personal contribution to the no-cloning theorem, made public for the first 
time after more than twenty years. Early in 1981, the editor of Foundations of Physics asked me to be 
a referee for a manuscript by Nick Herbert, with title "FLASH — A superluminal communicator based 
f> I upon a new kind of measurement." It was obvious to me that the paper could not be correct, because it 

violated the special theory of relativity. However I was sure this was also obvious to the author. Anyway, 
nothing in the argument had any relation to relativity, so that the error had to be elsewhere. 

Herbert's apparatus was an idealized laser gain tube which would have macroscopically distinguishable 
q-( outputs when the input was a single arbitrarily polarized photon. Indeed, the word LASER is an acronym 

for "Light Amplification by Stimulated Emission of Radiation." However, besides the stimulated emission, 
there is also spontaneous emission, which results in noise. Herbert's claim was that the noise would not 
prevent identifying the polarization of the incoming photon, at least statistically. For this, he used the 
notion of quantum compounds and many properties of laser physics with which I was not familiar. 

I recommended to the editor of Foundations of Physics that this paper be published 0]. I wrote 
that it was obviously wrong, but I expected that it would elicit considerable interest and that finding 
the error would lead to significant progress in our understanding of physics. Soon afterwards, Wootters 
and Zurek |l]] and Dieks Q published, almost simultaneously, their versions of the no-cloning theorem. 
The tantalizing title "A single quantum cannot be cloned" was contributed by John Wheeler. How the 
present paper got its name is another story ||. 

There was another referee, GianCarlo Ghirardi, who recommended to reject Herbert's paper. His 
anonymous referee's report contained an argument which was a special case of the theorem in references 
jl], |1. Perhaps Ghirardi thought that his objections were so obvious that they did not deserve to be 
published in the form of an article (he did publish them the following year j7j). Other objections were 
raised by Glauber and then by many other authors whom I am unable to cite, because of space 
limitations. 

With some hindsight, it is now clear that the no-cloning interdiction was implicitly used by Stephen 
Wiesner in his seminal paper Conjugate Coding which was submitted circa 1970 to IEEE Transactions 
on Information Theory, and promptly rejected because it was written in a jargon incomprehensible to 
computer scientists (this actually was a paper about physics, but it had been submitted to a computer 
science journal). Wiesner 's work was finally published in its original form in 1983 || in the newsletter 
of ACM SIGACT (Association for Computing Machinery, Special Interest Group in Algorithms and 
Computation Theory). Another early article, Unforgeable Subway Tokens [jlpj] , also tacitly assumes that 
exact duplication of a quantum state is impossible. As it often happens in science, these things were well 
known to those who know things well. 



Anyway, the no-cloning theorem was no justification for rejecting Herbert's paper. His manuscript 
mentioned exact duplication as a theoretical ideal, but it was clear that what was actually produced by 
the laser gain tube was a messy state, where many different outputs were entangled with the laser final 
state. 

The no-cloning theorem and its practical implications for quantum cryptography led to an enormous 
wave of interest, as I had predicted. Every month one or more papers on this subject are put in the 
quant-ph electronic archive or appear in other media. Typical questions that are discussed are how to 
achieve optimal, if not perfect cloning a question which led to the important notion of quantum 
disentanglement fl^| ; the no-broadcasting theorem [fl3[ , which is a generalization of no-cloning to general 
impure density matrices; M — > N probabilistic cloning jli , 15 ; rigorous and stronger theorems Jig , [l7| , |l8f ; 
and in particular the relation of optimal cloning to the impossibility of signaling and bounds 

on state estimation that are imposed by the no signaling condition p^| . The problem was finally laid to 
rest by a fundamental result: completely positive maps, the only ones that can be realized by quantum 
mechanical systems, cannot increase distinguishability pq] . 

There is no doubt that the term "superluminal" was one of the causes this subject became so attractive. 
Who would not be happy to beat the relativistic limit on the speed of transmission of information? 
Actually, superluminal group velocities have been observed in barrier tunneling in condensed matter 
p4 [25[ |. However, special relativity does not forbid the group velocity to exceed c. It is the front 
velocity of a wave packet that is the relevant criterion for signal transmission, and the front velocity 
never exceeds c. 

Clearly, the term superluminal has no room in the present considerations. The speed of light never 
enters in them, nor does the theory of relativity. The issue simply is: given two observers with an 
entangled bipartite quantum system (or even numerous such systems: they are equivalent to a single, 
bigger, physical system) it is impossible, by means of local quantum operations (LQO), to transmit 
any information whatsoever from one observer to the other, without transmitting real material objects 
between them. 

This is what quantum mechanics says |Pq| , and a speculative question could be: what kinds of 
modifications of quantum mechanics would allow information transfer by means of LQOs. In the following, 
I'll show that a nonlinear evolution of the density matrix p would give such a result. A similar conclusion 
was also reached by Svetlichny |27| , who however only considered the special case of an initial maximally 
entangled pure state, and made further restrictive assumptions (and also used the term "superluminal" 
without ever using the theory of relativity) . 

Consider our two familiar observers, Alice and Bob, who have a bipartite quantum system in a known 
state p. They perform LQOs that are mathematically represented by positive operator valued measures 
(POVMs) with elements A? = ^ ano - S = ^ respectively. The probability for the joint result jp is 



Pjti = Tr^Trs (pAj (g> B^), (1) 

where the double trace is taken on the indices of Alice and of Bob. If Bob is not informed that Alice got 
result j, the probability that he gets p is 

^P jM = Tr(^iU (2) 

3 

where ps = Tr^(p) is Bob's reduced density matrix. This result does not depend on Alice's choice of a 
POVM. This is what quantum mechanics says, and all this is well known. 

If Bob is informed that Alice got result j, it follows from (1) that the probability that Bob has result 
// is Tr BiPjB^), where 



pj =Tr A (pA j )/p j . (3) 

Here the denominator pj = ^2 u Pj V is the probability that Alice gets result j, so that pj has unit trace 
as it should. 

Therefore everything happens as if the state of Bob's subsystem actually was pj. Quantum mechanics 
does not claim that this is true, but also gives no way of showing that this realistic point of view is 
false. It's just a matter of belief. Bernard d'Espagnat calls Bob's reduced density matrix an "improper" 
mixture |p8[ . I also considered such situations in [Q, where I distinguished pure states, mixtures, and 
"compounds" (a compound is a mixture that has a unique decomposition into pure states if additional 



information is supplied). Here, I am slightly more general: Bob's reduced density matrix can be split in 
a unique way into other density matrices (not only pure states) if Alice reveals to him which result she 
got. 

The question is whether Bob can do that without Alice's help, and thereby find out which POVM 
she chose to perform. For this, one has to violate quantum mechanics in some way, such as cloning, 
as Nick Herbert originally proposed. Here, it is essential to assume that cloning an improper mixture 
(a "compound") means to clone each component separately, as if the state really was one of these 
components, with probability pj, and we merely ignored which one. 

Let us suppose that Bob can perform a trace-preserving nonlinear transformation 

Pj~>Pj, (4) 
on the unknown "true" states of his subsystem. Then his reduced density matrix evolves as 



Pb = Pj Pj -^Z^PjPj- ( 5 ) 

3 3 

Note that the result is not determined by pb alone, but the decomposition of ps into a definite set of 
pj is essential. In particular, the result depends on the choice made by Alice of a particular POVM, as 
Eq. (||) explicitly shows. 

If Alice chooses a different POVM, = 1, giving Bob states = TrA{pA s )/p s with probability p s , 

Bob would obtain, after the hypothetical nonlinear evolution, 



PB 



= ^PsPs -> ^2psPs- (6) 



Let us call the right hand sides of the last two equations p' and p" , respectively. In the generic case, they 
are not equal to each other and they can be distinguished statistically. If enough copies are supplied to 
Bob, he would be able to know which POVM Alice chose. 

Needless to say, the assumption of a nonlinear evolution of p violates quantum mechanics. Contrary- 
wise, if p evolves linearly, then p' = p" and no information can be transmitted in this way. Note that all 
the discussion is about the density matrix p. It is quite possible to have a nonlinear evolution of pure 
states which corresponds to a linear evolution of p. For example, the evolution 



p) ~" (o 



(7) 



is nonlinear in terms of pure states, because 



>-G) + 'G)- 

and the right hand side would give (a + (3) ( Q ) if the evolution were linear. 

On the other hand, the same evolution expressed in terms of density matrices appears as 

a b \ (\ 
b* 1 -a) ~* \0 
This is a linear operation, generated by a pair of Kraus matrices 



(9) 



1 °) and (° (10) 

In summary, Nick Herbert's erroneous paper was a spark that generated immense progress. There 
also are many wrong papers that have been published in reputable journals, some of them by renowned 
scientists. Their bad influence may last for years. For these, I decline all responsibility. I was not the 
referee of these papers and I could not protect the good reputation of their authors. 

I am grateful to Erika Andersson, Gilles Brassard, Dagmar Brufi, Chris Fuchs, GianCarlo Ghirardi, 
Nobu Imoto, Danny Terno, and Bill Wootters for many clarifying comments and for helping me to locate 
some of the references below. This work was supported by the Gerard Swope Fund and the Fund for 
Encouragement of Research. 
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